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The prc&ent invention describes a method to scan gene databases for the simple 
active site motif characteristic of aspartyl proteases, Eukaryotic aspanyl proica^s such as 
pepsia and renin possess a two-domain structure which folds to bring two aspartyl residues 
into proximity within the active site. These are embedded ic Ihc short tnpeptide motif 
0TC, or more rarely^ DSG. Most aspanyj proteases occur as proeazyme whose N-tcrmitvus 
must be deaved for^vzuoa. The DTG or DSG active site motif appears ai about residue 
65-70 in ihe proenzyme ^sfs^cimi^ pepsinogen), but at about residue 25-30 in the active 
enzyme after cleavage of the N-tenninal prodomain. The limited length of the active site 
motif makes it difficult to search coUecuons of short, expressed sequence lags (EST) for 
novel aspartyl proteases. EST sequences typically average 250 nucleotides or less» and so 
would encode 80-90 amino acid residues or less. Thai would be too short a sequence to 
span the two acuve site motifs. The preferred method is to scan databases of bypotheticaJ 
or assembled protein coding sequences. The present invention descnbes a computer 
^octbod lo idcnufy candidate aspartyl proteases in protein sequence databases. The method 
was used to identify se^tn candidate aspartyl protease sequences in the Caenorhahdtds 
elcgans genome. These sequences were then used to identify by homology search Hu-Aspl 
and two altemative splice variants of Hu-Asp2, designated herein as Hu-Asp2(a) and Hu- 
Asp2(b). 

In a major aspect of the mvention disclosed here we provide new information about 
APP processing. Pathogeneic ;»-ocessing of the amyloid precursor protein (APP) via the 
Ap pathway requires the sequential action of two proteases referred to as p-secreiase and 7- 
sccreta.^. Cleavage of APP by the p-secretasc and y-secretase generates the N-tcrminus 
and C-termmus of the Ap peptide, respectively. Because over production of the Aj3 
peptide, particularly the A&m2, has been implicated in the initiation of Alzheimer^ s disease, 
inhibitors of either the ^-secretase and/or the •y-.'wcretase have potential in the tieatmcnt of 
Akheimcr's disease. Despite the importance of the P-secretase and 7-secretase in xht 
pathogenic processing of APP, molecular definition of these enzymes has cot been 
accomplished to date That is. it was aoi known what enzymes were required for cleavage 
at Cither the p-secrctasc or the ^^sccrcLasc cleavage sue. The sites themselves were 
known because APP was known and the ApM2, peptide was known, see US 5,766,846 and 
US 5,837,672. (incorporated by reference, with the exception to reference to "soluble" 
pepudcs). But what caryme was involved in producing the ApM2» peptide was unknown. 
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The present invention mvoJvcs the m&lecaisj dcftnirjon of scvcniJ novel human 
aspartyl proteases aad one ©f these, referred to as Hu-Asp-2(a) and Hu- Asp2(b). has been 
characterized in detail. Previous forms of asp I and asp 2 have been disclosed, sec EP 
0S48O62 A2.aad EP 0S5S444A2. inventors David Powel el. a]., assigned to Smith Kline 
Beccham Corp. (incorporated by reference) Herein are disclosed old and nevw fonns of 
Hu-Asp 2. For the first ume they are expressed in active form, their substrates aie 
disclosed, and their specificity is disclosed. Prior to this disclosure cell or ceU extracts were 
required to cleave the p-sccrctase site, now purified protein can be used in assays, also 
described here Based on the results of (1) antisense knock out experiments, (2) transient 
transfection knock in experiments, and (3) btochemicai experiments using purified 
recomhinart Hu-Asp-2, we demonstrate thai Hu-Asp-2 is the p-secretase involved in the 
processing of APP. Although the nucleotide and predicted amino acid sequence of Hu- 
Asp- 2(a) has been reponed. see above, see EP 0848062 A2 and EP 085 5444 A2, no 
funcuoi^al charactcnzation of the enzyme was disclosed, Here the authors charactcnzc the 
Hu-Asp-2 enzyme and arc able to explain why it is a critical and esseiMia! enzyme required 
in the formation of Aflwi, peptide and possible a critical step in the development of AD, 

in another cmbodimcn: the present invention also describes a novel splice variant of 
Hu-Asp2, referred to as Hu-Asp-2Cb}, that has never before been disclosed 

]Ti another embodiment, the invention provides isolated nucleic acid molecules 
comprising a polynacleoade encoding a polypeptide selected from the group consisting of 
human aspartyi protease 1 (Hu-Aspl ) and two alteniaiive splice variants of human aspartyl 
protease 2 (Hu-A£p2). designated herein as Hu-Asp2{a) and Hu-Asp2(b). As used herein, all 
references to "Hu-Asp2" should be understood to refer to both Hu-Asp2Ca) and Hu-Asp2Cb). 
Hu-Asp I is expressed most abundantly in pancreas and prostate tissues, while Hu-Asp2(a) 
and Hu-Asp2(b) arc exprsssed most abundantly in pancreas and brain tissues. The mvention 
also provider isolated Hu-Asp 1, Hu-Asp2Ca), and Hu-A5p2(b) polypeptides, as welJ as 
fragments thereof which exhibit aspartyl protease activity. 

The predicted amino acid sequences of Hu-Aspl, Hu-Asp2(a) aud Hu-Asp2(b) share 
signi/icant homology with previously identiSed mammalian aspart>'i proteases such as 
pepsinogen A, pepsmogcn B, cathepsin D, cathepsin E, and renin. P3.S2ecs, Scand I Clin. 
Lab. Invest, 52:(Suppl. 210 5-22 CI 992)). These enz>TOes are charactenzed by the presence of 
a duplicated DTG/DSG sequence motif. The Hu-Asp I and HuAsp2 polypeptides disclosed 
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herein also exhibit extremely hi^ homology with the ProSiie consensus motif for aspartyl 
proteases extracted iiom tbe SwissProt cSacabase. 

The micleoudc sequence given as residues 1-1554 of SEQ ID N0;1 corresponds to 
the nucletmtk sequence encoding Hu-AspU ibc nucleotide sequence given as residues 1-1503 
of SEQ ID N0.3 corre^nds to the nucleotide scqucricc encoding Hii-Asp2(a), and the 
nucleotide sequence given as residuts 1-1428 of SEQ ID N0:5 corresponds to the nucleotide 
seqiieacc escoding Hu-Asp2(b). The isolation and sequencing of DNA encoding Hu-Aspl* 
Hu-Asp2(a), and Hu-Asp2{b) is described below hi Examples 1 and 2. 

As is described in Examples 1 and 2, auiomaied sequencing methods were used to 
obtain the nucleotide sequence of Hu-AspU Hu-Asp2(a), and Hu-Asi>*2(b). The Hu-Asp 
nucleotide sequences of the present invention were obtained for both DNA strands, and arc 
believed to be 100% accurate. However, as is known in the an* nucleotide sequence obtained 
hy snch automated methodi; may contain some errors. Nucleodde sequences determined by 
auiomataott are typically at least about 90%, more typically at least about 95% to at least about 
99 9% idcnticsl to the actual nucleotide sequence of a given nucleic acid molecule. The 
actual sequence may be more precisely dcteriiuucd using manual sequencing methods, which 
are well known in the an. An error in sequence which results in an insertion or dfiictior: of 
one or more nucleotides may result in a frame shift in translation such thai the predicted 
amino acid sequence will differ from that wfajch would be predicted from the actual 
nucleotide seqiier\C€ of the nucleic acid molecule, staning at the point of the mutation. The 
Hu-Asp DNA of the present mvention includes cDNA, chemically synthesized DNA, DNA 
isolaied by PCR, genomic DNA, and combinations thereof. Genomic Hu-Asp DNA may be 
obtained by screcnin| a gcnomjc hbrary with the Hu-Asp2 cDKA described herein, using 
methods that are well isnown in the an, or with oiigon\;cleoli6es chosen from the Hu-Asp2 
sequence thai will pnme the polymerase chain reaction (PCR). RNA transcribed from Hu- 
Asp DNA is also encompassed by the present invention 

Due lo the degeneracy of the genetic code, two DNA sequences may differ and yet 
encode ideniical amino acid sequences. The present invention thus provides isolated nucleic 
acid molecules haviag a polynucleotide sequence encoding any of the Hi>-A5p polypeptides of 
the inventioru wherein said polynucleotide sequence encodes a Hu-Asp polypeptide having 
the complete amino aad sequence of SEQ ID N0;2, SEQ ED N0:4, SEQ ID N0:6, or 
fragments thereof. 
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